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Abstract 

We prove existence and uniqueness results for solutions to a class of op- 
timal transportation problems with infinitely many marginals, supported 
on the real line. We also provide a characterization of the solution with 
an explicit formula. We then show that this result implies an infinite di- 
mensional rearrangement inequality, and use it to derive upper bounds 
on solutions to parabolic PDE via the Feynman-Kac formula. This result 
can be used to provide model independent bounds on the prices of certain 
derivatives in mathematical finance. As an another application, we obtain 
refined phase-space bounds in quantum physics. 

1 Introduction 

In this paper we introduce and initiate the investigation of a novel problem - 
an optimal transportation problem of Monge Kantorovich type with infinitely 
many marginals. Given measures (or marginals) \xt on R, for all t G [0,1], 
and a strictly convex function h : K — > R, we look for a stochastic process X t 
with continuous sample paths, such that lawA t = /i t for all t € [0,1], which 
maximizes 

E(h( f X t dt)) (MK^ 



Our motivation in studying (MKco) comes partially from an analogy with 
more familiar problems with finitely many marginals. The (two marginal) op- 
timal transportation problem of Monge and Kantorovich is to couple a pair of 
probability measures together as efficiently as possible. This problem has many 
deep connections to a variety of areas of mathematics, such as geometry and 
PDE, and has found applications in many diverse fields, including economics, 
fluid mechanics, meteorology, and image processing [34] [35] . Given two Borel 



*The author was supported in part by a University of Alberta start-up grant. 
^Department of Mathematical and Statistical Sciences, 632 CAB, University of Alberta, 
Edmonton, Alberta, Canada, T6G 2G1 pass@ualberta.ca. 



1 



probability measures fiQ and fj>i on smooth manifolds Mq and Mi, respectively, 
and a surplus function b : Mq x Mi — » R, the problem consists of finding the 
maximizer of the functional: 



6(x ,xi)d7 



(MK 2 ) 



M xMi 



among measures 7 on Mq x Mi which project to hq and /xi under the canon- 
ical projections. Under mild conditions, a solution 7 exists. Under somewhat 
stronger conditions, one can show that this solution is unique and is concen- 
trated on the graph {x\ — T(xq)} of a function over Xq; see Brenier [2], Gangbo 
and McCann QJ], Gangbo [IT], Caffarelli @], McCann [23], and Levin [JJ. 
In this case, we say that T solves Monge's optimal transportation problem. 
Let us note that the class of cost functions satisfying this condition includes 
c(a:o,a;i) = h(xo + Xi), where h is strictly convex. 

An interesting extension of (MK2) is the problem of aligning several distri- 
butions with maximal efficiency. That is, given m Borel probability measures 
fii, for i — 1,2,...., to on smooth manifolds Mi, M%, M m , respectively, and a 
surplus function b : Mi x M2 x ... x M m — > R, maximizing 



among measures 7 on Mi x Mi x ... x M m which project to the /i^ under the 
canonical projections, for i = 1,2, to. 

For our purposes, it will be useful to have an equivalent formulation of this 
problem using more probabilistic language. From this perspective, we look for 
the Mo x Mi x ... x M m valued random variable (Xi, X 2 , A m ), such that 
law(A^) = [ii for all i = 1,2, ...,to, which maximizes the expectation: 



As in the m = 2 case, it is straightforward to prove the existence of a 
maximizer. The literature on (MK m ), however, is much more limited than that 
on (MKz)- The case that is most relevant to us here is when Mi = W 1 for all 
i = 1, 2, m and c(xi, x 2 , x m ) — h(^2 i=l Xi), for a strictly convex function 
h, this problem has been studied by Gangbo and Swiech [13] and Heinich [16] . 
using a duality result of Kellerer [17]; see also [24] [18] [30] and [31]. In this case, 
they proved that the optimal measure 7 is unique and that it is concentrated on 
the graph {x\, fiixi), ■■■■fm(x m )} of a function over the first marginal, inducing 
what is often called a Monge solution. Since then, a handful of additional 
results haven been obtained for other surplus functions, by Carlier [5], Carlicr 
and Nazaret [6], and the present author [28] | 27 ) . 

It is natural to ask about the limit as the number of marginals, to, goes to 
infinity. Here, instead of prescribing to marginals, we prescribe a continuum of 
marginals /if on K", for t e [0, 1], and rather than looking for a random variable 
(Ai, A m ), we search for random curves X t , (ie, stochastic processes) that 
satisfy law(A t ) = [i t for each t e [0, 1]. Heuristically, replacing the cost function 




(MK m ) 



£(c(Ai,A 2 ,...,A m )) 



(MK m ) 
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h(J2iLi x i) with h(L Xtdt), we obtain problem (MK^) as the limit of (MK m ) 
as m — > oo. 

In this paper, we study (MKoa) but restrict our attention to the case n = 1. 
The machinery developed here does not generalize easily to the higher dimen- 
sional case; however, in a subsequent paper, we address the higher dimensional 
case for the quadratic surplus (that is, h(-) = | • | 2 ), using very different tech- 
niques 25 . In the multi- marginal setting, when n = 1, Carlier [3] showed that, 
for a class of cost functions including strictly convex functions of the sum, the 
optimal maps fi :M-)l,i = 2,3, m are all increasing. Our main result here 
(Theorem 13. l.ip is an infinite marginal analogue; there exists a unique optimal 
stochastic process X t and it is concentrated on an increasing family of curves. 

When h(x) = e x , the reader may recognize expressions of form (MK^) from 
the Feynman-Kac representation of solutions to parabolic PDE (the Feynman- 
Kac expression is actually somewhat more complicated, but see the extension 
in subsection 3.2). Our main result implies a stochastic rearrangement inequal- 
ity (Theorem 14.1.11) ; for any stochastic process, X t , E (h( X t dt)^j can be 

bounded above by the same expression with the process X t replaced by a partic- 
ularly simple process. Therefore, this result can be used to derive upper bounds 
on the solutions to parabolic PDE which are straightforward to compute. Let 
us briefly mention that upper bounds on expressions of the form (MK^) can 
be also be found by Jensen's inequality, but that our bound is always tighter 
than the Jensen's inequality bound. We discuss this in more detail in section 4. 

Parabolic PDE arise frequently in various applied contexts, including mathe- 
matical finance and mathematical physics among many others. In mathematical 
finance, for example, the price of a zero coupon bond maturing in one year is 
given by E[e~~ $° x * dt ]; where the stochastic process X t represents (stochastic) 
interest rate, given by some model. As another example, the value of an Asian 
call option on a stock maturing in one year is E(h(J^ X t dt)), where X t is the 
value of the stock price and h(x) = max(0, x — k) for some fixed k (the strike 
price) is piecewise linear. In some of these models, the expectation itself may be 
difficult to compute analytically, but the single time marginals can be readily 
determined and results in section 4 can be used to determine an upper bound 
on the interest rate simply by doing two 1-dimensional integrals. 

Let us add that, from a financial perspective, the potential interest in these 
inequalities does not only lie in estimating integrals that are difficult to com- 
pute; it can also be used provide information about certain phenomena that is 
less model dependent than standard financial methods. Model independence in 
finance has recently seen quite a bit of attention. The goal here is to under- 
stand the difference between derivative prices arising from models with the same 
marginals, as is outlined in pQ, [15] and [10]. Note that in these papers, while 
the authors restrict to discrete time models, they also impose the condition that 
the process X t should be a martingale, which is natural in a financial context. 
Our optimal process, X t for (MKoo), is not a martingale; nevertheless, it does 
provide an upper bound on the prices of appropriate derivatives arising from 
any model yielding the same marginals, and so may be relevant in their context. 
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Expressions of the form E i^h(J^X t dt)J also arise frequently in mathemat- 
ical physics. In this context, upper bounds on these expressions can be used 
to derive phase space bounds in quantum mechanics. The best known example 
is perhaps Lieb's proof of the Cwickel-Lieb-Rosenbljum (CLR) bound, which 
itself is a special case of Lieb-Thirring inequalities, and can be used to derive 
the more general case [21j . The Lieb-Thirring inequalities are well known for 
their crucial role in Lieb and Thirring's famous proof of the stability of matter 
|22] . The general process for obtaining phase-space bounds is outlined nicely in 
the book by Simon [3]; very briefly, the key idea is to apply Jensen's inequal- 
ity to expressions of the form (MK^), then use Fubini's theorem to rearrange 
the upper bound into a desirable form. We show that our result here provides 
tighter bounds on these quantities. 

In the next section, we outline our main assumptions and make our basic def- 
initions. In section 3, we state and prove our main result. In the fourth section, 
we use our main result to derive a rearrangement inequality for stochastic pro- 
cesses, which can then be used to derive upper bounds on solutions to parabolic 
PDE via the Feynman-Kac formulas. In the final section, we describe a specific 
application of this inequality: new phase space bounds in quantum physics. 



2 Preliminaries and assumptions 

In this section, we outline our assumptions and preliminary definitions. Many 
of these assumptions can be relaxed, at the cost of more complicated proofs; in 
subsection 3.2, for example, we discuss two extensions of this theorem, including 
a relaxation of the first assumption. 

Let II be the set of all real valued, measurable stochastic processes X t with 
continuous sample paths such that \a,wX t = fit for all t £ [0, 1]. We will assume: 

1. The function h : M — > K is strictly convex and C 2 smooth. 

2. Each fi t is absolutely continuous with respect to Lebesgue measure. 

3. The support of each fi t is a (possibly unbounded) interval: spt^ t = [a t , b t ], 
where a t may be — oo and b t may be oo. 

4. The path of measures 1 1— > fit is weakly continuous. 

5. The first moments and h moments of fit exist and are integrable in t. 
That is, 

/ | / xdfit (x) | dt < oo , / | / h{x)dfit{x) \dt < oo. 
Jo Jr Jo Jr 



Note that, for any process X t € II, applying Jensen's inequality twice and 
using Fubini's theorem yields: 
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h([ [ xdfj, t (x)dt) = h( ( E(X t )dt) = h(E( [ X t dt)) 



E(h{ f 
Jo 



< E(h( X t dt)) 



• i 



< E{ h{X t )dt)= / E(h(X t ))dt= / / h(x)d[M(x)dt 



o 

Therefore, assumption 5 ensures that E{h{Q X t dt)) is finite for each com- 
petitor x t g n. 

Our last assumption requires a little more notation, which we develop now. 
For each A e (0, 1), define the curve 

x x (t)=F t -\\) 

where Ff 1 is the inverse of the cumulative distribution function F t of \i t . Our 
final assumption can be interpreted as a regularity condition on the curve of 
measures fit , expressed in terms of the curves x\ (t) : 

6. The Riemann sums J2f =1 x\(jj>) converge to x\(t)dt as N — > oo, uni- 
formly in A on compact subsets of (0, 1). 

Note that weak continuity of the family /i t (assumption 4) implies that the 
curve t — > x\(t) is continuous and hence Riemann integrable. The Riemann 
sums above then converge to the integral, but the convergence is not necessarily 
uniform in A. 

Now, we define a particular stochastic process in n, which, as we will prove 
in the next section, is actually optimal for {MK^). 

Our underlying probability space will be the interval (0, 1), with Lebesgue 
measure. Define the process 

X? pt = X?' t (\)=x x (t), for AG [0,1]. (1) 

That is, the sample paths are the curves t n- x\ (t) , distributed uniformly in A. 
Note that these sample paths are increasing in A: whenever A > A, x\(t) >> 
Xj(t) for all t. In addition, it is clear from the construction that lawX t op * = fi t - 
Next, we define some functions which will be useful in the proof of our main 
result. For each t € [0, 1], we define a function g t : M — >• R by 

g t (x) = h'( / x x (r)dr), 
Jo 

where A = Ht{{— oo,x)) = F t (x). We then set 

Gt{x) = [ 9t{y)dy 

Jxi(t) 



Jo Jr 



5 



As h is convex, x H> h'(x) is increasing. As x i-> i 7 *^) = A and A H> J Q x\(t)dt 
are also increasing, it follows that x i— > <?t(x) is an increasing function of a; and 
son-> Gt(a;) is convex. 

Finally, as the function A i— > J* x\(t)dt is monotone, its (possibly infinite) 
limit as A increases to 1 exists: denote I ma i = lim^i- f Q x\(t)dt. Similarly, 
define I min = lirn A _>. 0+ f Q x\(t)dt. 

3 Main Result 

3.1 Statement and proof of main result 

Our main result is the following. 

Theorem 3.1.1. The process X° pt defined in equation {!]) is the unique maxi- 
mizer in (MK^xf). 

The proof requires several Lemmas. 

Lemma 3.1.2. For any process X t £ II, E(f^ G t (X t )dt) = f Q J„Gt(x)d^t(x)dt 

Proof. The result is a simple application of Fubini's theorem: 

E(f G t (X t )dt) = f E(G t (X t ))dt 

i 

Gt{x)d^t{x)dt 

□ 



o 



Lemma 3.1.3. For each A €E (0, 1), the curve X\{f) maximizes the functional 

F(x (•)) =h{[ x(t)dt) - f G t (x(t))dt (2) 
Jo Jo 

among all continuous curves x(-) on [0,1], with x(t) € (a t ,b t ) for all t. 

Proof. Our strategy will be to approximate these integrals by Riemann sums, 
and exploit known results on the corresponding optimal transportation problem 
with finitely many marginals. For a positive integer N, set U = 4r, for i = 
1,2, ...,N and consider the A^-marginal optimal transportation problem with 
marginals /i^ and surplus function 

J~] N x 
b(x tl ,xt 2 ,...,xt N ) = h{- 



N ' 

Note that this surplus function satisfies the strictly two monotone condition in 
[5] (essentially that q x ® g Xt > for all i ^ j); by a result of Carlier [5 (extended 
to unbounded domains in 26 J, the solution is concentrated on the set 

{(xx(t 1 ) 1 x x {t 2 ),...,x x (t N )) : AG (0,1)}. 
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It is well known that there exists a solution (tfi^ , 4>ti, 4>^ N ) to the Kantorovich 
dual problem [17] . These functions satisfy: 



with equality on the set {(x\(ti), xxfa), X\{tjs[)) : A e (0,1)}. 

Furthermore, due to the convexity of /i, the functions cj>^ are all convex, and 
due to assumptions 2 and 3, they are differentiable and satisfy the first order 
condition 

for all A £ (0,1). 

Up to an additive constant, we can assume that each </>^ (xi(ti)) = 
Now, let t — > x(t) be any continuous curve such that x(t) £ (a t ,b t ) for all 
t £ [0, 1]. Then we have, for any A: 



K u ^ {i) ) -£<(*(*)) ^ fct ^nr 1 ) - £ <o»*(*0) m 

For each s £ [0, 1], set \ s — F s (x(s)). We then have A s e (0, 1) and iea s (s) = 
.x(s). By assumptions 2 and 4, the map s t— > X s is continuous. Therefore, there 
exists some < c < d < 1 such that c < A s < d for all < s < 1. 

Therefore, by the monotonicity of A i— > x\(t), we have x c (t) < x\ 3 (t) < Xd{t), 
and so, for all s £ [0, 1], 

J c := / x c (t)dt< [ x Xs (t)dt< [ x d (t)dt=:I d 
Jo Jo Jo 

Therefore, s i-> x> s is bounded above and below; choose K such that 
| h' (x) | < K whenever I c — 1 < £ < Id + 1 

Now, for each point xt t <E (at^&tj such that A := Ft^a;^) € [c, d], we have 

iC'fe,) - = ±\h'( T -<"** lt ' > )-h>(j\ x (t)dt)\ (4) 

Choose e > 0. By uniform convergence of the Riemann sums of x\(t) (as- 
sumption 6), we can choose N so large that we have 



x x {t)dt\ < e (6) 



lE^i^tt.) 



N 



for all A <E [c, d] . 
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Now, the continuous function t H> x(t) — xi (t) is bounded on [0, 1]: 

\x(t)-xi(t)\<M, (7) 

and so integrating 0^/(x ti ) and gti(%u) from xi(ti) to x(ti) and combining 
inequalities ([5]), ©, and (O yields 

\<%{x(U))-^G t MU))\= < \<t>Z'{x)-^ 9ti {x)\dx 

J X l iti) 

< KM—e 

N 

We then have 

N N 
i=l i=l 

Now, the term J2iLi JfG ti (x ti ) converges to J Q G t {x{t))dt as N — > oo and 

so the above inequality implies that J^Li ^ui^h) converges to J Q Gt(x(t))dt 

as well. Similarly, ^ (ai.\(ii)) converges to J* G t (x\(t))dt. As h( ^-' i = ] ^ Xti ) 

converges to h(J^ =0 x{t)dt) and x a(A)) converges to h(J. Q x\(t)dt) the 

result now follows by taking the limit of ([3]). 

□ 

Lemma 3.1.4. For each A S (0, 1), the curve x\(-) maximizes the functional 

J r (i(-)) = h( f x{t)dt) - f G t {x(t))dt (8) 
Jo Jo 

among all continuous curves x(t) on [0, 1], with x(t) £ spt(/it) for all t. 

Proof. By the preceding lemma, we only need to consider the case where x(t) = 
b t or a t for some t. We will assume here that x(t) — b t for some t, but a t < x(t) 
for all t (the proofs of the other cases are similar). 

First, suppose that \im x ^ I+ h'{x) > 0. This implies that linx^^ gt{x) > 
for each t. Then, for e > 0, consider the set 

A e = {t: x(t) - e G (a tl b t )} n {t : g t (x(t) - e) > 0} 

As an open subset of [0, 1], A e can be written as a countable union of disjoint 
intervals, A e — Dili. For each i, let qi(t) : Ii — > K be a continuous function such 
that qi(t) = on the boundary of Ii and < < 1 on the interior of 7,. We 
define the function 

x e (t) = x(t) — eqi(t), if t € Jj for some z 
ai e (i) = ai(t), otherwise. 



8 



Note that x e (t) < x(t) everywhere, and the inequality is strict for t £ A e . As 
x h-> gt(x) is monotone increasing and gt(x(t) — e) > on A e , we have gt(x) > 
for x(t) - e < x < x(t). Thus, ^r(x) = g t (x) > 0, which implies G t (x e (t)) 
increases as e decreases. Furthermore, |i € (i) — x(t)\ < e, so x e (t) — > x(t) as 
e ->■ 0. By continuity, G t (x c (i)) -> G t (x(f)). 

As the points where i t = & t all lie in A e , we have a t < x e (t) < b t everywhere. 
Therefore, Lemma T3. 1.31 

h([ * e (t))- [ G t (x e (t))dt < h( [ x x (t))- [ G t {x x {t))dt 
Jo Jo Jo Jo 

Taking the limit as e — > 0, using the monotone convergence theorem on 
Jq 1 x e (t)dt and G t (x e (t))dt and the continuity of h yields the desired result. 

Finally, if lim j+ h!(x) < 0, then gt(x) < for all x £ (a t ,b t ). A similar 
argument to the above implies that e H >• Gt{x e (t)) as e decreases and applying 
the monotone convergence theorem again yields the desired result. 

□ 

Lemma 3.1.5. The curves x\(t) are the only maximizers of the functional 
J 7 (i(-)) in Lemma \3.1.4\ 

Proof. To show that no other continuous curve x can be a maximizer, we use a 
variational argument. Assume that x(-) is a curve such that x(t) £ spt(fj, t ) for 
all t, maximizing ([8]) . Let y(t) be a continuous function such that x(t) +ey(t)) £ 
spt(fii) for sufficiently small e > 0. We then have: 



> 



dF(x(t) + ey{t)) 



de+ 



d 
de~+ 



£=0 
1 



h\ I x{t)+ey(t))dt- { G t (x(t) + ey(t))dt 



ii 



h'(j x(t)dt) j y{t)dt- j g t (x(t))y(t)dt 

h'^J i(r)dr) - g t {x{t))\y{t)dt 

Note that we have used Lebesgue's dominated convergence theorem on the last 
term. 

Now, choose A such that J x\(s)ds — J x(s)ds; we will show that x\(t) = 
x(t) for all t. The proof is by contradiction; assume x\(t) ^ x(t) for some fixed 
t. Suppose x\(t) > x(t) (the case x\(t) < x(t) is similar and is omitted). Then 
by continuity, x\(s) > x(s) in some neighbourhood U of t. By monotonicity of 
x i-> g s (x), this implies that g s (x\(s)) > g s (x(s)) in U 

Now, as x(s) < x\(s) < b s on U, we may choose y(s) > on U and y(s) = 



(9) 
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elsewhere and have x(s) + ey(s) G spt(fi s ) everywhere. Then we have, for s € U 



h'y x{r)dr) - g s {x(s)) y(s) = h'yj x x (r)dr) - g s {x(s)) y{s) 



> 



h'( / x\(r)dr ) - g s (x\(s)) y(s) 



h'i^j x\(r)drj - h'yj x\(r)drj y{s) 



As y(s) = outside of U, this yields 



h'{^ j x(r)drj — g s (x(s)) y(s)ds > 



This contradicts inequality ([9]), completing the proof. 



We are now in a position to prove Theorem 13 .1.1 1 



□ 



Proof. It is clear from the definition that X° pt € II; that is, that lawX t opt = /if. 
Now, by Lemma 13.1.41 given any process Yj G II with single time marginals /it, 
we have, for each individual sample path and each A, 



//(/ Y t dt) < h([ x x (t))- f G t (x x (t))dt + [ G t {Y t )dt) 
Jo Jo Jo 

= M+ f G t {Y t )dt 



where M denotes the minimal value of the functional T in Lemma 13.1.41 Note 
that we have equality if and only if the sample path Yt = X\(t) for some A, by 
Lemma 13.1.51 Taking expectations and using Lemma |3. 1 . 21 yields 

E(h([ Y t dt)) < M + E{[ G t (Y t )dt) 



= M+ / G t {x)dnt{x)dt. 

JO JR 

and we have equality if and only if Y t = x\(t) for some A, almost surely. Now, if 
we interpret a stochastic process as a measure on the set of continuous functions, 
it is clear that only one measure with marginals fi t can be supported on the set 
{x\(-) : A € [0, 1]}; namely, the optimizer X° pt . This completes the proof. □ 
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3.2 Extensions of the main theorem 

In this subsection we present two extensions of our main theorem, designed 
specifically with potential applications in mind. 

The first extension is a relaxation of the smoothness and strict convexity of ft. 
One motivation for this result is to find an upper bound on the price of an Asian 
option, whose value is given by E[h(J^ X t dt)], for a particular process X t with 
h(x) = max(0, ax — b) (ie, a non smooth, convex but not strictly convex surplus 
function). In this case, we potentially lose uniqueness in our main theorem. 

Theorem 3.2.1. Suppose that ft is convex. Assume conditions 2-6, and in 
addition assume that the second moments of \it exist and are integrable, that is, 
f R x 2 dfi t (x)dt < oo. Then the process X° pt defined in section 2 is optimal 
foriMK^). 

Proof. Let h n be a sequence of smooth, strictly convex functions such that 
hn{x) > h n+ i(x), h n (x) — > h{x) pointwise and 

ft (as) < h n {x) < h(x) + 1 + x 2 , 

Then ft„ satisfies assumptions 1 and 5, and therefore, for any Y t € II we have, 
by Theorem |3XT] 

E ( h n{J Q Y t dt))<E(h n (^ X° t pt dt)^ 

Now, by the monotone convergence theorem, £J^ft n ( f£ Ytd£) \ converges to 

E(h( Jq Y t dt)y and similarly E(h n (J^ X° pt dt)^j converges to E(h( X° pt dt)}, 
implying the desired result. □ 

Our second extension concerns coupling a stochastic process Xt with pre- 
scribed single time marginals and a real valued random variable Z with a pre- 
scribed law, to maximize E[h(J^ X t dt)Z\. The motivation here is the Feynman- 
Kac formula; expressions of this form represent solutions to parabolic PDE and 
in the next section, we use our extension here to prove upper bounds on these 
solutions. Our extension here holds when Z is positive almost surely. If Z is 
merely non-negative almost surely, the process we construct is still optimal, but 
we possibly lose uniqueness. 

Let [i t be a curve of measures, satisfying the conditions in the second section. 
Let v be a measure on the upper half line K + = {y > 0}, absolutely continuous 
with respect to Lebesgue measure. Then consider the problem of finding a pair 
(X t , Z), where X t is a continuous stochastic process, with \&wX t = [i t an d Z a 
random variable with lawZ = v, maximizing 
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Recall the stochastic process X° p from the section 2 and define the random 
variable Z opt = Z opt (X) on the same underlying probability space [0, 1] implicitly 
by 

v(-oo,Z°#(\)) = A 

Then the pair (X° pt , Z opt ) satisfies lawX° p * = and lawZ opt = v. We also 
define 

g(z) = h'( I x x (t)dt)z x , 
Jo 

where A = v{{— oo, Z\)). We then set 

G(z) = / 9t(y)dy 

J z l/2 



Theorem 3.2.2. Assume conditions 1-6 from section 2 and that the measure v 
on (0, oo) is absolutely continuous with respect to Lebesgue measure. Then the 
pair (X° pt , Z opt ) is the unique optimizer for (MK^+x). 

Proof. The proof is similar to the proof of the main theorem, and is only sketched 
here. The key adjustment is in Lemma r3.1.3[ where we replace /i(2i=i x u) with 
hQ^iLi xti )z, where z > 0. This surplus function is again strictly two monotone, 
and so the result of Carlier [S] applies to the N+ 1 marginal optimal transporta- 
tion problem with this surplus function and marginals fi^ , Att 2 , •■•Mtjv ! v - Letting 
cj)^, (f>^ 2 , ■■■4>t 1 N j u N solve the dual problem, we obtain, for any curve x(t), point 
z > and A € (0,1): 



~(+ \ N X^ N (+ \ N 

K u = lX[ti) )z-J2K(Hti))-u N (z) < h( ^=^ U) )z x -^^(u))-u N ( 

i=l i=l 

From the first order conditions arising in the N + 1 marginal problem, we have 
u N '(z) = V( E '=y A(tl) )z A , where A = v((-ac, z x )). Much like in the proof of 
Lemma r3.1.3l one can use this to show that as N — > oo, u N (Z) — > G(Z). As in 
the proof of Lemma 13.1.31 we can take the limit as N — > oo and obtain 

h( f x{t)dt)y- f G t {x{t))dt~u{y) <h{( x x {t)dt)y- f G t {x x {t))dt-u{y x ). 
Jo Jo Jo Jo 

Here, Gt := f^^gt(y)dy is defined as in section 2, except that we take 

2 

g t (x) — h'(J Q x x (r)dr)z x , for A = Ft(x), whereas in section 2 we took g{x) = 
h'(f x x (r)dr). 

The remainder of the proof is analogous to the proof of the main theorem 
and preceding lemmas in subsection 3.1 and is omitted. □ 
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4 Rearrangement inequalities and applications 



4.1 A rearrangement inequality 

Carlier observed that a solution to the multi-marginal problem in 1-dimension 
yields a refinement of the Hardy-Littlewood inequality [3] . Here, we use our main 
theorem to develop an analogous infinite dimensional rearrangement inequality 
of Hardy-Littlewood type. Given a stochastic process Y t , set fit =lawF t . We can 
then find the process X° pt corresponding to these marginals, as in section 2 (that 
is, a monotone rearrangement of Y t ) and, assuming the marginals and convex 
h satisfy assumptions 1-6, Theorems 13.1.11 and 13.2.21 yield the following result, 
which can be viewed as an infinite dimensional version of the Hardy-Littlewood 
inequality 

Proposition 4.1.1. Let Y t be a stochastic process whose marginals fit =law(Y t ) 
satisfy conditions 1-6. Let Z be a random variable on R + and assume v =law(Z) 
is absolutely continuous with respect to Lebesgue. Let (X° pt , Z opt ) be the opti- 
mal process corresponding to these marginals, defined in section 2 (X° pt ) and 
subsection 3.2 (Z°^). We then have 



E(h( f Ydt)) < E{h{ [ X° pt dt)) 
Jo Jo 



h( ( x x (t)dt)d\ 
o Jo 



E{h{ f Y t dt)Z) < E{h{ [ X opt dt)Z opt ) 
Jo Jo 
i ,i 

h( / x\(t)dt)z\d\ 
o Jo 

It is interesting to note that Jensen's inequality also yields upper bounds on 
quantities of the form E(h(J Y t dt)), and these upper bounds are often used in 
finance and physics (see the examples in the subsequent section). Recall that 
for each sample path Y t of a stochastic process and convex h, Jensen's inequality 
says: 



h{ I Y t dt) < [ h(Y t )dt 
Jo Jo 



so that, for any stochastic process Yt, we have: 

E(h{ f Y t dt)) <E([ h{Y t )dt) 



We now prove our bound (|4.1.1[) is tighter than the bound coming from 
Jensen's inequality. 
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Proposition 4.1.2. Given a stochastic process Y t , let X° pt be the optimal pro- 
cess from section 2, with marginals pit —law(Y t ). Then: 

E(h( f X° pt dt)) <E(f h(Y t )dt) 
Jo Jo 

Proof. We apply Jensen's inequality to E(h(J^ X° pt dt)), and use Fubini's the- 
orem on the resulting expression, yielding 

E(h( J V ° J " ' ' ■ r > I '■' v "" i 



X° pt dt)) < E(l h(X° pt dt)) 
o 

E{h{X° pt ))dt 







/ h(x)d[i t (x))dt 
Jm 

/ h(y)dtit(y))dt 

l 

E{h{Y t ))dt 

o 

= E([ h(Y t )dt) 
Jo 



where we have used the fact that the law(Yf) = \&w(X° pt ) —[it- □ 

Also of note is that, by applying Jensen's inequality to the expectation E 
rather than the integral J , we obtain a lower bound on E(h(J Q Yt)), which 
by Fubini's theorem, depends only on the marginals law(Yt). Combining our 
main theorem with Jensen's inequality in this way, then, gives upper and lower 
bounds for E(h(J^ Y t )) which any process Y t with marginals [it must satisfy. 

4.2 Example 

We illustrate the rearrangement inequality in Proposition ^. 1 . II with an example. 
In this example, the convex function h(x) = e x is the exponential function. 

Proposition 4.2.1. Let Wt be Brownian motion starting at Wq = and 

_ 2 

/it =law(Wt), so that d[it{x) — e J^ t dx. Then, for s > 0, 

E{eS5 Wtdt ) = 

E{\[e^dt) = A[e4-1] 
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After rescaling the interval [0,1] to [0,s], these three quantities represent, 
respectively, E(e^o w t dt ^ the upper bound from Proposition 14.1.11 (see the Re- 
mark below), and the upper bound from Jensen's inequality, respectively. It is 

a 3 2s 3 n s 3 

straightforward to verify directly that e~ < e~ < -4[e = — 1], as is implied 
by Propositions 14.1.11 and 14.1.21 

Remark 4.2.2. Strictly speaking, our main theorem does not apply here, as the 
marginal fiQ is a Dirac mass at x = and not absolutely continuous with respect 
to Lebesgue (the other conditions from section 2 are easy to verify). However, 
Lemma \3.1.3\ implies that for any e > and any curve x(t)we have 

h{J x t dt)- j gt{x t )dt<h{J^ x x {t)dt)- j g t (x x (t))dt 
Taking the limit as e — > yields: 

h(f x t )dt- [ g t (x t )dt<h([ x x {t)dt)- [ g t {x x {t))dt 



Jo Jo Jo Jo 

We can then prove, just as in Section 3, that for any process Y t € II, we 



l r i 

opt 



E{h{ / Y t dt)) < E{h{ / X° pt dt)) 
Jo Jo 

Proof. Let h be the exponential function. The curves x\(t) here satisfy 

/T.ft) -v 2 ,- x a W -y 2 
dy = / dy 
-oo v27rf J-oo v27T 

which implies x\(t) — y/iz\, where A =er{(z\). Therefore, J Q S x\(t)dt = f^s- 
Our expectation becomes: 



E(h(l X° pt dt)) = I e% z ^dX 

2 3 

e 3 zs2 e — dz 



n 





3 


Jo 






/' 00 




' — oo 








' — oo 


2 3 3 

e 9 




V2tt „ 


' — oo 


2., 3 





e s) 



-i[( Z -^l) 2 -44] dz 



e aV* 3 > dz 



Now, as Wt is Brownian motion, it is well known that Z s — Wtdt is 

3 

normally distributed, with mean zero and variance 4-. Therefore, we have that 
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2tts 3 



e z e 2^"c?z 



= P. 6 



2tts 3 



3(z-4,-) 2 

$—dz 



e 2 



= e e 

Finally, we compute the upper bound from Jensen's inequality. 



E(eK w ** t ) < E(- [ e sWt dt) 
s Jo 



E(e sWt )dt 



s poo 



1 -W 2 

-_e sW e—dWdt 



I lid 



i r j> t r°° i , 

eT" / -^=e 57 dWcZt 



e 2 ds 



2 ,3 



□ 



4.3 Upper bounds on solutions to Parabolic PDE 

Consider the partial differential equation on (x,t) £ K" +1 : 




= V(x,t)f 



with terminal condition f(x,T) = ip(x). Here, b = b(x,t) is an R n valued func- 
tion and A = A(x, t) an n x n matrix valued function. Equations of this type 
have numerous applications in mathematical finance, mathematical physics, bi- 
ology, etc. According to the Feynman-Kac formula, the solution is given by: 

f{$,s) = E[e-Ss y ^ dt ^{X T )] (10) 

where X t is the solution of the stochastic PDE 

dX = b(X, t)dt + A(X, t)dW, X s =x 

Here W is a Weiner process on M. n . Suppose that ip > 0. Letting [i t =law(V(Xt)) 
and v =la,w (iI>(Xt)), then, if assumptions 1-6 are satisfied, Proposition ^. 1 . ll tclls 
us that: 
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f(st,a) < E[e-^ x ° ptdt Z opt ] (11) 

= / e-^ x ^ t)dt z x d\ (12) 
Jo 

Depending on b, A and V, the Feynman-Kac formula may be very difficult 
to evaluate analytically. In practice, the integral (fTO]) is often computed numer- 
ically using Monte-Carlo techniques. In certain cases, however, the single time 
marginals of the real- valued process V(X t ) can be determined. To find an up- 
per bound on f(x, t), then, we must only compute the integral (|11[) . which boils 
down to computing two, 1-dimensional integral. Even if this integral cannot 
easily be calculated analytically, it is a much simpler task to compute numeri- 
cally than the infinite dimensional integral in (|10[) . In the next subsection, we 
present a particular application in quantum physics. 

5 Application: phase space bounds in quantum 
physics 

In this subsection, we illustrate how our result can be used to prove refinements 
of two phase-space bounds in quantum physics. A discussion of the physical 
significance of these bounds can be found in the book of Simon [3j. The basic 
idea behind their proofs is that, as outlined by Simon, phase space bounds can 
be derived by applying Jensen's inequality to quantities of the form (MKqo). 
Our general strategy here is to mimic these proofs, replacing Jensen's inequality 
with the result from Proposition 14.1.11 as a consequence of Proposition 14.1.21 
our phase space bounds here are tighter. 

Let V be a continuous, integrable, function on R™ which is bounded from 
below. For fixed t, let W* be a Brownian bridge process in s on R™, starting 
and ending at x = 0, at times s = and s = t, respectively; that is, Wq — 
and W\ = 0. 

Let h be the exponential function and, for each x £ R™, assume that the mea- 
sures fi s ,t,S,V =law[V(iG + W%)] satisfy conditions 1-60- Let Y^ t sv he defined 
by the implicit relationship 

A = ^ S!i ^y(-00, Y^j S y). 

Let Ho be — ^ A, where A is the Laplacian. 

The following classical bound on the trace of the operator e~ t ^ Ho+v ' > is due 
to Golden [TJ], Thompson [33] and Symanzik [32] : 

1 Note that, strictly speaking, the measures /i s t jy cannot satisfy assumption 2, as fi t g v 
and )it t S V are Dirac masses at V(x); however, this can be dealt with as in the example in 
subsection 4.4. 
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The proof of our bound below is based on Symanzik's proof of inequality ([IB"]) , 
outlined in the book by Simon. 

Theorem 5.0.1. 

Jw Jo 

Proof. We follow the argument in Simon, using our inequality from Theorem 
14.1. II in place of Jensen's inequality. 

As in Simon, the Feynman-Kac formula implies: 

Tr ( e -Wo+v ))= f E [e- K v ( s+W ^ ds ]dx. 
Jr 3 

Applying Proposition 14. 1.11 to E[e~ fo V{S+Wt)ds^ y- ie \^ s the desired result. □ 

A similar argument yields a bound on the number of bound states, that 
is, the number N(V) of negative eigenvalues of Hq. Using Jensen's inequality, 
this leads to Lieb's proof of the celebrated Cwickel-Lieb-Rosenbljum bound [21], 
which states that, for n > 3, N(V) is bounded from above by the L n / 2 norm of 
the negative part of V: 

N{V) < a [ \V-{x)\ n/2 dx. (14) 

Let us mention that the independent proofs of Cwikel [8] and Rosenbljum 
[29] rely on different techniques. Other proofs of the CLR bound have since 
been found by Li and Yau [20], Fefferman [9] and Conlon [7]. 

Applying our main theorem instead leads to the bound below. 

Theorem 5.0.2. For any convex function h satisfying 1-6, 

p oo p /* 1 rt 

N(V) < C(h) / r 1 h( Y s x t 3 v ds)d\dxdt. 

Jo Jr™ Jo Jo 

Here, C(h) is a constant depending on h. 

Proof. The argument of Lieb (again, reproduced by Simon) yields, for any con- 
vex function h on R 

N(V)<C(h)J^ r 1 J E(h( J v(W^)ds)^dxdt 
We apply our inequality to E(ji( J * V(Wg)ds) yielding the desired result. 

□ 

Remark 1. Let us note that the above holds for any convex function h. Lieb's 
proof goes on to apply Jensen's inequality to h(J Q V(W*)ds) and obtains |7^p 
with 

s- 1 - n ' 2 h{s)ds 
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If n > 3, if is possible to choose the function h so that both C(h) and the 
preceding integral are finite I21}/W\/. In principal, one can then optimize over 
convex functions h to obtain as good a constant a as possible (note, however, 
that the optimal constant is not known). Note, however, that for whichever 
function h one chooses, our bound above, using the same h, is tighter. 

It should be emphasized that, while our bounds appear complicated, they 
can be calculated for any convex function h simply by determining the single 
time marginals of V[x + W*] and calculating several (two in Theorem 15.0.11 and 
three in Theorem l5.0.2|) 1-dimensional integrals and one n-dimensional integral, 
which is relatively easy to do numerically even if the form of V makes it difficult 
analytically. On the other hand, our bounds improve the classical phase space 
bounds, as our inequality from Proposition 14. 1 . ll is always tighter than Jensen's 
inequality, by Proposition I4.1.2T 
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